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Abstract 

We prove that any non-amenable Cayley graph admits a factor of IID 
perfect matching. We also show that any connected d-regular vertex tran- 
sitive graph admits a perfect matching. The two results together imply 
that every Cayley graph admits an invariant random perfect matching. 

A key step in the proof is a result on graphings that also applies to 
finite graphs. The finite version says that for any partial matching of 
a finite regular graph that is a good expander, one can always find an 
augmenting path whose length is poly-logarithmic in one over the ratio of 
unmatched vertices. 



1 Introduction 

Let r be a finitely generated group, and G a locally finite Cayley graph of T. 
An invariant random subgraph on G is a probability distribution on the set of 
subgraphs of G that is invariant under the natural action of Y on G. 

A factor of IID is a particular way of defining an invariant random subgraph. 
We only sketch the definition here. First each vertex gets a random number in 
[0,1], independently and uniformly. Then each vertex makes a deterministic 
decision on how the subgraph looks like in its neighborhood, based on what it 
sees from itself as center. Since each vertex uses the same rule, the distribution 
of the resulting subgraph is automatically invariant under the action of Y. 

Instead of subgraphs, one can also define vertex colorings, or more general 
structures on G. The general name for such a random process is a factor of 
IID process. An important feature is that such a process can automatically be 
modeled on any good finite model of G. For instance, any factor of IID process 
on a regular tree can be modeled (with small error) on finite regular graphs with 
large girth. 
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Invariant random processes, and in particular factor of IIDs on Cayley graphs 
have received considerable attention recently. Standard percolations are triv- 
ially factor of IID processes, as well as the free and the wired minimal span- 
ning forests. Another example is the recent solution of the measurable von 
Neumann's problem by Gaboriau and Lyons (see [1]). They show that every 
non-amenable Cayley graph admits a factor of iid 4-regular tree. 

It is a long standing open problem to determine the maximum density i(G) 
of a factor of IID independent subset of a regular tree (mentioned e.g. on the 
webpage of David AldousQ). The exact value is unknown, though it is known to 
be less than 0.46. Note that trees are bipartite and thus have independent sets 
of density 1/2, but the resulting process can not be a factor of IID. The related 
open question is to determine the limit of the ratio i(G(n, d)) of the largest 
independent subset in n vertex d-regular random finite graphs, as n goes to 
infinity. Bayati, Gamarnik, and Tetali in [T] have recently shown that the limit 
exists, and the above mentioned modeling phenomenon shows that its value is 
at least i{Td) where Td is the ci-regular infinite tree. A conjecture of Balazs 
Szegedy (see Conjecture 7.13 in [5]) claims that this limit is in fact equal to 
i{T d ). 

In this paper we settle the analogous question for the maximum density of 
independent edge sets in non-amenable Cayley graphs. An independent edge 
set in a graph is usually referred to as a matching. An obvious upper bound 
on the density of a matching is that of the perfect matching, i.e. where every 
vertex is covered by an edge. We show that in our case one can actually achieve 
the maximum possible density, that is, one can construct a perfect matching as 
a factor of IID. 

Theorem 1.1. Let V be a finitely generated non- amenable group with finite 
symmetric generating set S. Let G — Cay(Y, S) denote the associated Cay- 
ley graph. Then there is a factor of IID on G that is almost surely a perfect 
matching. 

This extends the result of Lyons and Nazarov [7] who proved the same 
statement for bipartite non-amenable Cayley graphs. 

In particular, every non-amenable Cayley graph admits an invariant random 
perfect matching, which was also not known. Jointly with Abert and Terpai, 
the authors showed the following theorem. 

Theorem 1.2. Every infinite vertex transitive graph G has a perfect matching. 

Abert and Terpai kindly suggested to include the result in this paper. Now, 
following an observation of Conley, Kechris, and Tucker-Drob ([5]) this implies 
that every amenable Cayley graph admits an invariant random perfect matching. 
Thus, together with Theorem ll.il we get the following. 

Corollary 1.3. Every Cayley graph admits an invariant random perfect match- 
ing. 

J http: //www. stat .berkeley . edu/~aldous/Research/OP/inv- tree .html 
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The basic strategy of the proof of Theorem 1 1.1 1 is similar to what Lyons and 
Nazarov use to prove the bipartite case, and what has been used by Elek and 
Lippner [3] to construct almost-maximal matchings. We define a sequence of 
partial matchings, each of which is obtained from the previous one by flipping 
a sequence of augmenting paths. To show that this sequence "converges" to 
a limit perfect matching, one has to show that edges do not change roles too 
often. The crucial step is to bound the length of the shortest augmenting path 
in terms of the ratio of unmatched vertices. 

Our main contribution is establishing this bound for general graphs. When 
applying the result on finite graphs, we get the following theorem, that is of 
independent interest in computer science. 

Theorem 1.4. For any cq > and al > 3 integer, there is a constant c — c(co, d) 
that satisfies the following statement. If a partial matching in a cq- expander 
d-regular graph leaves at least e ratio of all vertices unmatched, there is an 
augmenting path of length at most clog 3 (I / e) , or there is a set of vertices H C G 
such that \H | > 3, \H\ is odd, and the number of edges leaving H is at most d. 

Remark 1.5. The theorem remains true even if there are only two unmatched 
vertices. This may be surprising at first, but in fact the condition that any odd 
set H has at least d edges leaving it easily implies the conditions of Tutte's 
theorem, so such graphs always have perfect matchings. 

In the bipartite case, such a bound has actually already been observed in [5] 
by Jerrum and Vazirani, who used it to give a sub-exponential approximation 
scheme for the permanent. They remark in the same paper that a similar 
bound for general graphs would be desirable, as it would lead to an approxi- 
mation scheme for the number of perfect matchings for arbitrary graphs. In 
a subsequent paper we shall work out the details of this application, together 
with a generalization of Theorem ll.4l to non- regular graphs. 

The outline of the paper is as follows. In Section [2] we show the existence of 
perfect matchings in vertex transitive graphs. In Section [3] we prove that in a 
non-amenable Cayley graph there is a factor of IID that is a perfect matching, 
modulo a variant of Theorem 11.41 whose proof we postpone to Section 0J 

Acknowledgements. We are indebted to Miklos Abert for introducing the 
problem to us and for his constant encouragement. We would also like to thank 
him and Tamas Terpai for valuable discussions and their kind permission to 
include the proof of Theorem II .21 in this paper. 

Both authors' research is partially supported by the MTA Renyi "Lendulet" 
Groups and Graphs Research Group. Gabor Lippner is further supported by 
AFOSR grant FA9550-09-1-0090-DOD-35-CAP. 

1.1 Notation and definitions 

Let G be a simple graph, either finite or infinite. The vertex and edge set of G 
will be denoted by V(G) and E(G) respectively. 
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Definition 1.6. A matching in G is a subset M C E(G) such that any vertex 
x is adjacent to at most one edge e € M. We will denote by V(M) the set of 
vertices that are matched, i.e. that are adjacent to an edge in M. A matching 
is perfect if V(M) = V(G). 

Definition 1.7. Given a graph G with a matching M, an alternating path is a 
path xqXi . . . Xk in G such that every second edge belongs to M. An alternating 
path is called an augmenting path if its first and last vertices are not matched. 

If x, y € V{G) are unmatched vertices and p is an augmenting path connect- 
ing x and y, then we can define a new matching M' = M(p) — Mo E(p) as the 
symmetric difference of the old matching M and the set of edges of p. The new 
matching will then satisfy V(M') = V(M) U {x,y}. 

Let (X, fi) be a standard Borel probability measure space with a non-atomic 
probability measure \x. 

Definition 1.8. A graphing on A is a graph Q such that V(Q) = X, and 
where G{E) C X x X is a symmetric measurable subset, such that if A, B C X 
are measurable subsets and / : A — > B a measurable bijection whose graph 
{{x, f{x)) : x e A} is a subset of E(Q), then fi(A) = fi(B). 

There is a natural way to measure the size of edge sets in a graphing. If an 
edge set is given by a measurable bijection / : A — > B as before, then the size of 
this edge set is defined to be fi(A). This extends to a measure on all measurable 
edge sets. In particular this implies that if H is a sub graphing of Q then the 
size of the edge set of H can be computed by the formula 



A measurable matching (or matching for short) in Q is a measurable subset 
M C E(Q) such that every vertex is adjacent to at most one edge in M . A 
matching is almost everywhere perfect if [i(V(G) \ V{M)) = 0. In this paper we 
will only be interested in almost everywhere perfect matchings, and will refer 
to them as perfect matchings for short. 

A graphing Q is a c -expander if for every measurable set H C V(Q) we have 
\E(H,V(g) \ H)\ > c Q \H\\V(g) \ H\, where E(A, B) denotes the set of edges 
having one endpoint in A and one endpoint in B. 

Let T be a finitely generated group, and S C T a finite symmetric generating 
set, and G = CayiT, S) the associated Cayley graph, that is g e V is connected 
to gs for every s G S. T acts on itself by left multiplication, and this naturally 
extends to a left action on X = [0,l] y(G) = [0,l] r by gx{^) = x{g~ 1 ^). The 
latter action is called the Bernoulli shift of L. We can equip X with a probability 
measure \x which is the product of the Lebesgue measure in each coordinate. It 
is easy to see that the Bernoulli shift action is measure preserving. 

L also naturally acts from the left on Y = {0,1} B ' G ^ whose elements can 
be considered as subsets of E(G). We can also equip Y with the product of 
uniform measures on the coordinates. 




(1) 
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Definition 1.9. In our context a factor of IID is a measurable, T equivariant 
map (f> : X — > Y. 

Definition 1.10. The graphing Q associated to the Bernoulli shift and S is 
given by Q{V) = X and Q(E) = {(x,y) € X x X : 3s € S.s" 1 ^) = y}- The 
connected component of almost any point a; 6 X is isomorphic to the Cayley 
graph G. 

Claim 1.11. There is a one-to-one correspondence between measurable subsets 
F C E{G) and factors </> : X -> Y. 

Proof. Let F C E{Q) be a measurable subset and / : E(Q) — »■ {0, 1} its charac- 
teristic function. Define <£p- : X —> Y by the following formula. 

<t>F(x)(g,gs) = f(s~ 1 g~ 1 x,g- 1 x). 

Then 

{hcj) F (x)){g,gs) = (f) F (x)(h' l g,h~ 1 gs) = f{s~ 1 g~ 1 hx,g- 1 hx) = cj> F (hx)(g, gs), 

so we do get a factor. 

Conversely, given a factor <j> one can define a subset C by choosing 

the edge s _1 a;, a; to be part of if and only if <j>(x)(id, s) = 1. □ 

Remark 1.12. 

• From this construction it is clear that _F is an almost everywhere perfect 
matching if and only if is a factor of IID perfect matching. 

• There is an entirely analogous correspondence between measurable subsets 
of V(Q) and factors : X —> {0 , l} r . In Lemma 2.3 of [7] then translates 
into the fact that if the Cayley graph G is non-amenable then there is a 
Co > depending only on the expansion of G, such that the graphing Q 
associated to the Bernoulli shift is a co-expander. 

2 Perfect matchings in vertex transitive graphs 

Let G(V, E) be an infinite, connected, c?-regular, vertex transitive graph. In this 
section we show that G has a perfect matching. The proof is done in three steps. 

Definition 2.1. A cut is a partition of V into a nonempty finite set A and its 
complement A c — V \A. The size of the cut is the number of edges between A 
and its complement. A best cut is a cut with minimum size. 

Lemma 2.2. Suppose A, B C V are different finite subsets defining best cuts. 
Then each of the sets A\B, B\A, AuB, and AD B is either empty or defines 
a best cut. 
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Proof. Let X = A \ B, Y = B \ A, Z = A n B, W = V \ (A U B). Then 



\E(X,X C )\ + \E(Y,Y C )\ = 

= 2\E(X,Y)\ + \E{X,Z)\ + \E{X,W)\ + \E(Y,Z)\ + \E(Y,W)\ < 
< 2\E(X, Y)\ + \E(X, Z)\ + \E(X, W)\ + \E(Y, Z)\ + \E(Y, W)\ +2\E(Z, W)\ = 
= \E{X U Z, Y U W)\ + \E{Y U Z, X U W)\ = \E(A, A c )\ + \E(B, B c )\ 

This shows that the cuts defined by X and Y are together at most twice the 
size of the best cut, hence they must be best cuts as well. (Or empty sets.) A 
similar argument works for Z and W (or rather X U Y U Z, since that is the 
finite set) as well. □ 

Lemma 2.3. The size of the best cut in G is d. 

Proof. Let X be a smallest finite set that defines a best cut. For any pair of 
vertices i,i/el there is an automorphism of G that takes x to y. Let Y be the 
image of X under this automorphism. Then clearly Y also defines a best cut, 
hence X \ Y is also a best cut. But \X \ Y\ < \X\ contradicting the minimality 
of X, unless X = Y. Hence the graph spanned by X is vertex transitive. If 
\X\ < d, then the number of edges leaving X is at least |X|(<f — \X\ + 1) > d and 
we are done. If \X\ > d, then since G is connected, there is an edge between 
a vertex x £ X and V \ X. But then by vertex transitivity of X, there is 
such an edge from every single vertex of X, giving the desired lower bound 
\E{X,X C )\ > \X\ > d. □ 

Corollary 2.4. Since the number of edges leaving any finite set Y is at most 
d\Y\, and the number of edges entering any finite set X is at least d, we get that 
the number of finite of components of G\Y is at most \Y\. 

Now we are ready to show the existence of perfect matchings in infinite 
vertex transitive graphs. 

Proof of Theorem By compactness it is sufficient to show that any finite 
subset X <zV can be covered by a matching in G. So assume for contradiction 
that there is no matching in G that covers a given finite set X. 

Let us construct an auxiliary finite graph G'(V',E') as follows. Let V = 
X U dX U M where dX is the outer vertex boundary of X and M is a non-empty 
set of new vertices such that \V'\ is even. We define the edge set E' to contain 
all original edges spanned by X U dX, furthermore we add all edges in dX U M 
to make it a clique. 

If G' has a perfect matching, then just keeping those edges of the matching 
that intersect X gives a matching in G that covers X. So we can assume that 
G' does not have a perfect matching. Then by Tutte's theorem there is a set 
Y C V such that the number of odd components of G' \ Y is greater than \Y\. 
But since \V'\ is even, we actually get that the number of odd components of 
G' \Y is at least \Y\+2. 
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The vertices of dX U M are always in a single component. Thus we can 
assume that Y is disjoint from M, since removing vertices of M from Y affects 
at most one component while reducing the size of Y . Then Y can be thought of 
a subset of V, and it is easy to see that any finite component of G' \ Y is also a 
finite component of G\Y, except perhaps for the one component containing M. 
Still, this means that G\Y has at least \Y\ + 1 odd components, all of which 
are finite, contradicting the previous Corollary. □ 

Later we will need a slight strengthening of Lemma 12.31 We say that a real 
cut is a cut where the finite set has at least 2 elements. 

Lemma 2.5. The size of the smallest real cut is bigger than d, unless every 
vertex of G is in a unique d-clique. 

Proof. Suppose the size of the smallest real cut is d, and let X be a smallest 
finite set that defines a smallest real cut. It is clear that \X\ > 2 since a set 
of size 2 defines a cut of size at least 2d — 2 > d. As before, let x, y S X and 
let Y be the image of X under an automorphism taking x to y. We are going 
to distinguish between three cases according to the size of X \ Y, which is the 
same as the size of Y \ X. 

If they have more than 1 element each, then they also real cuts and hence 
by Lemma T2.3l thev are also smallest real cuts, contradicting the minimality of 
X. 

If they are both of size 1, then \X D Y\ and \X U Y\ both have to be bigger 
than 1, hence they are also smallest real cuts, again contradicting the minimality 
of X. 

Thus \X \ Y\ = 0, hence X — Y, so just like in the proof of Lemma \2. 31 we 
get that X itself is vertex transitive. Thus, by connectivity, each vertex of X 
has an edge leaving X. Thus if \X\ > d + I then we are done. So \X\ < d and 
thus the number of edges leaving X is at least \X\(d— |X| + 1). This is strictly 
greater than d, unless \X\ = 1 or X is a clique of size d. The first is clearly 
not the case since X is a real cut. Thus X is a e?-clique. Then, of course, by 
transitivity every vertex of G is in a d-cliquc. 

Finally, it is not possible that a vertex is contained in more than one d-clique. 
If two different d-cliques A and B intersect then by degree of the vertices in the 
intersection we see that \A fl B\ = d — 1. Let {a} = A\B and {b} = B\A. If a 
and b would be neighbors then the graph would not be connected. Thus a has 
to have one neighbor c outside of B. But c cannot be connected to vertices in 
A nB, so A is the only d-clique that contains a. But by transitivity each vertex 
has to be contained in the same number of d-cliques, contradicting our setup. 
Thus two different d-cliques cannot intersect. □ 

Corollary 2.6. If the size of the smallest real cut in G is exactly d then there 
is a perfect matching in G that is invariant under the automorphism group of 
G. This matching is given by choosing the unique edge from each vertex that 
leaves the d-clique the vertex is contained in. 
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3 Factor of iid perfect matchings via Borel 
graphs — the proof of Theorem 11.11 

Let r be a finitely generated non-amenable group, S a finite symmetric gen- 
erating set of size \S\ = d, and G the associated Cayley graph. We want to 
construct a factor of IID perfect matching in G. 

If the size of the smallest real cut in G is equal to d, then by Corollary 12.61 
there is a fixed perfect matching in G that is invariant under the action of 
the automorphism group, and each vertex can decide which edge to choose by 
observing its own 1-neighborhood, so this is clearly a factor of IID matching 
and we are done. 

Thus we can assume that the smallest real cut in G is at least of size d + 1 . 
Let Q be the graphing associated to the Bernoulli shift, as in Definition ll.101 By 
Claim [TTTT1 and Remark 11.121 it follows that Q is a co-expander for some Co > 
depending only on G. Hence Q is admissible in the sense of Definition 14.11 

By Remark ll.l2l it is now sufficient to prove that Q has an almost everywhere 
perfect matching. Proposition 1.1 in [3] shows that there exists a sequence of 
matchings Mq, M\,M<i, ■ ■ ■ C Q such that a) there are no augmenting paths of 
length 2k + 1 in Mf, and b) each M k is obtained from M^-i by a sequence of 
flipping augmenting paths of length at most 2k + 1. We would like to construct 
an almost everywhere perfect matching as a limit of the M k s. In order to do 
this, we have to show that, except for a measure zero set, the status of any 
edge changes only finitely many times during the process, so we can take a 
"pointwise" limit of the sequence to obtain a matching that covers but a zero 
measure subset of X . 

Let us denote by the set of unmatched vertices in Mf.. Then in the process 
of getting Mfc+i from M k we are flipping augmenting paths starting and ending 
in Uk- Furthermore each vertex of Uk can be only used once as an endpoint of 
an augmenting path, since after that it becomes a matched vertex. Any edge 
that changes status between Mk and M k +i has to be part of an augmenting 
path at least once. Thus the total measure of status changing edges in this step 
is at most (2k + 3)\Uk\- If we can show that ^2 k {2k + 3)\Uk\ < oo then by the 
Borel-Cantclli lemma the measure of edges that change status infinitely many 
times is zero, and we are done. 

We have seen that Q is admissible. Let e — \Ut\- Then by Theorem 14.21 there 
is a constant c = c{c.Q,d) depending only on the expansion of Q and the degree 
d, such that there is an augmenting path of length at most clog 3 (l/e) in M^. 
But by definition we know that this has to be longer than 2k + 1. Thus we get 
2k + 1 < clog 3 (l/e) or equivalently 

This is clearly small enough to guarantee that ^2 k {2k + 3)|J7j.| < oo and thus 
complete the proof of Theorem 11.11 □ 



8 



Corollary 3.1. Every d-regular infinite Cayley graph has an invariant random 
perfect matching. 

Proof. For amenable graphs Conley, Kechris and Tucker-Drob observed in 
Proposition 7.5 of [2] that Theorem 1 1 . 21 implies the existence of invariant random 
matchings. 

Since a factor of IID perfect matching is automatically an invariant random 
perfect matching, Theorem 11.11 completes the non-amenable case. □ 

4 Short alternating paths in expanders 

Let G(X, E) be a d-regular graphing, or a connected, d-regular graph that can 
either be finite or infinite. We are going to treat these three cases at the same 
time. When it is necessary to point out differences, we will refer to them as the 
measurable / finite / countable case respectively. 

Definition 4.1. We say that G is admissible if it is a co-expander, and the 
smallest real cut into odd sets has size at least d+1 (in the sense of Lemma |2TS"|) . 

The following theorem includes the statement of Theorcm ll.4l and the variant 
about graphings that is needed for the proof of Theorem 11.11 

Theorem 4.2. For any co > and d > 3 integer, there is a constant c = c(co, d) 
that satisfies the following statement. Given any admissible measurable ( or large 
finite) graph, and a partial matching with at least e measure (or fraction) of 
unmatched vertices, there is an augmenting path of length at most clog 3 (l/e). 

Though our main goal is to prove theorems about measurable graphs and 
finite graphs, we are going to need auxiliary results about infinite, connected 
(i-regular graphs as well. Since the three cases can be handled the same way, we 
are going to present the proofs at the same time, pointing out differences when 
necessary. In the measurable case, everything will be assumed to be measurable, 
unless explicitly stated otherwise. If A, B C X then E(A, B) will denote the set 
of edges that have one endpoint in A and the other in B. In the measurable case 
the measure of the set A will be denoted by \A\. In the finite case \A\ is going 
to denote the size of A divided by the total number of vertices in the graph. So 
in both of these cases < \A\ < 1. In fact, a finite graph can be considered as 
a graphing with an atomic probability measure. However in the countable case 
\A\ is going to simply denote the size of A. Similarly with edge sets, in the finite 
and the measurable cases \E(A,B)\ will denote the measure of the edge set as 
defined by the integral ([I]) in Definition 11.81 In the countable case \E(A,B)\ 
will just denote the size of the set E(A, B). If we really want to talk about the 
actual size of sets in the finite case, we will denote it by \\A\\ and B)\\ 
respectively. 

Let M C E be a matching. Then V(M) C X shall denote the set of matched 
vertices. Let S C X \ V(M) denote a fixed subset of the unmatched vertices 
and let F = X \ (V(M) U S) denote the remaining unmatched vertices. We are 
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going to construct alternating paths starting from S in the hope of finding an 
alternating path connecting two unmatched vertices. Such an alternating path 
is called an augmenting path. 

4.1 Sketch of the proof 

First we give an outline, pointing out the main ideas without introducing the 
technical definitions. We encourage the reader to read the whole outline before 
reading the proof, and also to refer back to it whenever necessary. Without 
understanding the basic outline, many technical definitions will likely be rather 
unmotivated. 

1. We start from a set of unmatched vertices S. Assuming there are no short 
augmenting paths, we would like to show that the set of vertices (X n ) 
accessible via n-step or shorter alternating paths grows rapidly, eventually 
exceeding the size of the whole graph, leading to a contradiction. 

2. It will be necessary to keep track of matched vertices accessible via odd 
paths (head vertices), even paths (tail vertices), or both. In notation 
X n =SUH n UT n UB n . 

3. If there are plenty of edges leaving X n from T n or B n , then the other 
ends of these edges will be part of X n +i, fueling the desired growth. The 
first observation is that if this is not the case, then there has to be many 
tail-tail or tail-both edges. 

4. A tail vertex that has another tail- or both-type neighbor will normally 
become a both-type vertex in the next step. In this case even though X n 
does not grow, the set B n grows within X n , still maintaining the desired 
expansion that eventually leads to a contradiction. 

5. The problem is that certain tail- vertices will not become both- type even 
though they possess a both-type neighbor. These will be called the tough 
vertices. The bulk of the proof is about bounding the number of tough 
vertices. The key idea here is that we can associate to each tough vertex 
x a distinct subset of B n called the family of x. Families associated to 
different vertices are pairwise disjoint. (This is done in Section I4.3H 

6. There can not be too many tough vertices with large families. On the 
other hand if a vertex stays tough for an extended amount of time, its 
family has to grow. These two observation together should be enough to 
bound the number of tough vertices. 

7. The proof proceeds in two rounds from this point. First, if X n is smaller 
than half of the graph, then already families larger than Ad(d + l)/co are 
too large, and indeed vertices can't be tough too long before they reach 
this critical family size. Then all the previous observations are valid and 
X n grows exponentially as desired. (This is the contents of Theorem 14.51 
and the proof is done in Section l4~4l ) 
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8. In the second round, when X n is already quite big, this unfortunately 
does not work anymore. The bound after which families can be deemed 
too large grows as \X\ X n \ shrinks, and thus vertices can be tough longer 
and longer before their families become big enough. At this point it be- 
comes necessary to show that the families of tough vertices also grow 
exponentially fast. 

9. In Section 14.51 we demonstrate that the dynamics of how a family grows 
is almost identical to how the sets X n are growing. In fact families are 
more or less what can be reached from the tough vertex by an alternating 
path. But a family lives within an infinite countable graph, hence it is 
never bigger than "half of the graph", so only the first round is needed 
to show exponential growth. Hence Theorem 14.51 has a double gain. It 
proves the first round for X n , but at the same time it is used to prove fast 
family growth in the second round. 

10. Once we have established exponential family growth, an approach very 
similar to the proof of the first round is used to complete Theorem 14.21 in 
Section 14.61 The proofs of both rounds employ a method of defining an 
invariant whose growth is controlled. But the hidden motivation behind 
the invariant is what we have outlined in this sketch: if X n doesn't grow 
then B n grows. If B n doesn't grow either then there have to be many 
tough vertices. If there are many tough vertices then they have to be 
tough for a long time. But then their families have to become too big. 
Finally there is no space for all these big families. 

11. Unfortunately there is a final twist. When analyzing family growth in 
Section 14.51 we have to introduce certain forbidden edges in each step, 
through which alternating paths are not allowed to pass momentarily. 
Hence, to be able to use Theorem 14.51 in this more general scenario, we 
need to state it in a rather awkward way. Instead of saying that X n is just 
what can be reached by alternating paths of length at most n, we need to 
use a recursive definition of X n taking into account the forbidden edges in 
each step. But as it is pointed out in Remark 14.41 if one chooses to have 
no forbidden edges, X n just becomes what it was in this sketch. 

The proof is organized as follows. In Section 14.21 we introduce the basic 
recursive construction of the X^ sets using the notion of forbidden edges. We 
state the key Theorem l4.5l that on one hand provides the proof of the first round, 
and on the other hand will be used to show exponential family growth. 

Tough vertices and families are introduced in Section FOl together with proofs 
of their basic properties. Then Theorem 14.51 and the first round is proved in 
Section [4. 4[ using the invariant-technique. 

In Scction [4.5l wc show how the growth of a family can be modeled using the 
forbidden edge construction, and prove exponential growth of families. Finally 
in Section f4.6l we finish the proof of the second round, again using the invariant- 
technique. 
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4.2 Forbidden edges 

We are going to use the following terminology. All alternating paths will 
start with an unmatched edge, but may end with either kind of edges. If 
P = (PO)Pii • • • iPi) is an alternating path of length \p\ = I, then the vertices 
with odd index will be referred to as the "head" vertices of p and the even index 
vertices (except for p ) will be called "tail" vertices, p will be called even if I is 
even, and odd if I is odd. The last vertex will be denoted by end(p) = pi. When 
this doesn't cause confusion, we will also use p to denote just the set of vertices 
of the path. 

Definition 4.3. Assume that for every k we are given a subset of "forbid- 
den" edges E k C E. Using this as input data, we shall recursively construct a 
sequence of vertex sets 



Suppose we have already defined X k . Then Xk+i is defined as follows. Take a 
matched edge vw outside of Xk- We are going to include these two vertices in 
Xk+i if and only if there is an even alternating path starting in S whose length 
is at most 2k + 2, whose last two vertices are v and w in some order while all 
the previous vertices are in Xk, and, most importantly, the edge on which it 
leaves X k does not belong to E k . 

Remark 4.4. This definition implies that each X k consists of matched pairs, 
and for any vertex v 6 X k there is an alternating path p C X k such that po e S, 
\p\ < 2k, and end(p) = v. If the E k are all empty, then X k consists of all vertices 
accessible from S via an alternating path of length at most 2k. First we will 
show that the size of X k grows fast. 

Theorem 4.5. Suppose that 



2. there are no augmenting paths of length at most 2n — 1 starting in S, and 

3. \E k \ < d\S\ for allQ<k<n, 

4- the number of non-forbidden edges leaving X k is at least l/(d+ 1) portion 
of all edges leaving X k for all k < n. 



Note that the first condition is always satisfied in the countable case, since 
X n is always finite. 

We will need a more refined classification of the vertices in X n . First of all 
let 2l Q denote the set of all odd alternating paths starting from S, and 2l e the 



S = I„cIiCl 2 c... 



1 



\x n \ < \X\X n \ 



Then 
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set of all even alternating paths. For every n > 1 let us define the following 
subsets of X n . Let 

H n = {x G X n : Bp G a o (l < |p| <2n,pc A„;cnd(p) = a;)}, 

f n = {xeX n :3pe 2l e (2 < |p| < 2n,p C X„; end(p) = x)}, 

T n T n \ H n , 
B n = H n n T n . 

It is important that in these definitions we are not insisting that the paths 
avoid forbidden edges at any time. The forbidden edges only limit the definition 
of X n , but then we want to consider all possible alternating paths within the 
set. 

The last three are the set of head vertices, the set of tail vertices, and those 
that can be both heads or tails. It is clear that S and T n are disjoint. As long as 
there are no augmenting paths of length at most In — 1, then S is also disjoint 
from H ni and thus X n is a disjoint union of S, H n , B n , and T n . It follows from 
the definition that B\ C Bi C . . . , furthermore M gives a perfect matching 
between T n and H n , and also within B n . (Note that this implies \H n \ — |T„|.) 

The rough idea of why X n should grow fast is this. By expansion, even 
in the presence of forbidden edges, there are plenty of edges leaving X n . Any 
edge leaving X n from T n adds to the size of X n+ i directly. Only edges leaving 
from H n cause problems. But since H n and T n have the same total degree, 
any surplus of edges leaving H n have to be compensated by edges within T n or 
between B n and T n . Such edges will contribute to the growth of B n within X n , 
and thus implicitly to the growth of X n . 



4.3 Combinatorics of alternating paths 

In this section we will be mainly concerned about how edges within T n L) S and 
between B n and T„ U S contribute to the growth of B n . 

Lemma 4.6. If x,y G T n U S and xy G E then either x G B n+ \ or y G B n+ \ 
or there is an augmenting path of length at most 2n + 1 . 

Proof. It is sufficient to prove that either x or y would be in H n +i- Let p, 
respectively q be shortest alternating paths that witness x and y E T n respec- 
tively. We may assume without loss of generality that \p\ < \q\. Then y cannot 
lie on p, otherwise there would either be a shorter alternating path witnessing 
y G T n , or we would have y G H n and not in T n U S. Hence adding the xy edge 
to p we obtain an alternating path of length at most 2n + 1 that witnesses that 

y G Hn+l- □ 

Edges running between T n U S and B n are more complicated to handle. If 
b G B n and t G T n U S, but all paths witnessing b G T n run through t, then we 
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can't simply exhibit t G H n+ \ by adding the bt edge to the end of such a path 
since it would become self-intersecting. The following definition captures this 
behavior. 

Definition 4.7. 

• A vertex x G T n U S is "tough" if it is adjacent to one or more vertices in 
B n , but x g H n+1 . 

• An edge xy G E is "tough" if x G T n U S, y G B n and x is a tough vertex. 

TT n will denote the set of vertices that are tough at time n. 

We would like to somehow bound the number of tough vertices. In order 
to do so, we will associate certain subsets of X n to each tough vertex in a way 
that subsets belonging to different tough vertices do not intersect. Then we will 
show that these subsets become large quickly. 

Remark 4.8. We think of n as some sort of time variable, and all the sets evolve 
as n changes. Usually n will denote the "current" moment in this process. In 
the following definitions of age, descendent, and family, there will be a hidden 
dependence on n. When talking about the age or the family of a vertex, we 
always implicitly understand that it is taken at the current moment. 

Definition 4.9. The "age" of a vertex x G TT n is a(x) = n — min{/c : x G 
T k US}. 

Definition 4.10. Fix a vertex x G TT n . A set D C X n has the "descendent 
property" with respect to x if the following is true. For every y G D there are 
two alternating paths p and q starting in x and ending in y, such that 

• both start with an unmatched edge, but p is odd while q is even, 

• P ,qcDU{x}, 

• \ P \ + \q\ < 2a(x) + 1. 

Sets satisfying the descendent property with respect to x are closed under 
union. 

Definition 4.11. The "family" of a vertex x G TT n is the largest set D C X n 
that satisfies the descendent property. In other words it is the union of all sets 
that satisfy the descendent property. The family of x is denoted by F n (x). 

Claim 4.12. If x G TT n and xy is a tough edge then y is in the family of x. 
In particular every tough vertex has a nonempty family. 

Proof. Let p be a path that witnesses y G T n . Now if p appended by the edge 
yx would be a path then it would witness x G H n +i- Since this is not the case, 
x has to lie on p. Let D denote the set of vertices p visits after leaving x. For 
any point z G D there are two alternating paths from x to z. One is given by p 
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and the other by going from x to y and then walking backwards on p. Suppose 
x = P21 and y = p2k- The total length of these two paths is 2k — 21 + 1. Since the 
age of x by Definition is at least k—l we see that 2k— 21+1 < 2a(x) + l. Hence 
the two paths satisfy all conditions of Definition 14.101 so D has the descendent 
property with respect to x. Hence by Definition 14. 1 II a; has a non-empty family, 
in particular y is in the family. □ 

Claim 4.13. The family of any tough vertex is a subset of B n . 

Proof. Let x 6 TT n be a tough vertex and let s be a shortest path witnessing 
x 6 T n U S. Let us denote \s\ — 2k. It is enough to show that the family of x is 
disjoint from s. Indeed, then for any point y in the family we can take the two 
types of paths p, q as in Definition 14.101 from x to y. By the age requirement 
in Definition l47T0l we get that \p\ + \q\ < 2a(x) + 1 = 2n - 2k + 1. Hence 
\s\ + \p\ + \q\ < 2n + 1 and thus \s\ + \p\ < 2n — 1 and |s| + \q\ < 2n. Since these 
paths run within the family which is disjoint from s, we can append s with 
p and q respectively to get alternating paths witnessing y G H n and y G T n 
respectively. 

Now suppose the family of x is not disjoint from s. It is clear that any family 
consists of pairs of matched vertices. Let i be the smallest index such that the 
pair S2i-i> S2i is in the family. Then from x there is an odd alternating path p 
to S2i by Definition 14.101 that runs within the family and its length is at most 
2a(x) + 1 < 2n — 2k + 1. Since i was the smallest such index, the path p is 
disjoint from so, s%, . . . S2i-i- Thus by appending sq, s±, . . . , S2i by the reverse 
of p we get an alternating path from an unmatched point to x ending in an 
unmatched edge, whose length is at most 2n — 2k + 1 + 2i < 2n + 1. This path 
witnesses x G H n+ i, contradicting the toughness of x. □ 

Next we will prove that any vertex can belong to at most one family. We 
start with a simple lemma about concatenating alternating paths. 

Lemma 4.14. Let p be an even alternating path from x to y and q an odd 
alternating path from y to z. Then there is an odd alternating path from x to 
either y or z whose length is at most \p\ + \q\. 

Proof. If the concatenation of p and q is a path, then we are done. Otherwise 
let i be the smallest index such that pi G q. Let pi = qj. Then po,p\, . . . ,pi — 
1j> Qj+ii ■ • • 1 end(g) is a path from x to z and po,Pi, ■ ■ ■ ,Pi = <lj, Qj-i> ■ ■ ■ <7o is a 
path from x to y. Both have length at most \p\ + \q\, both of them end with 
non-matched edges and one of them is clearly alternating. □ 

Claim 4.15. Two families cannot intersect. 

Proof. Let x, y G TT n be two tough vertices. Assume their families F and G do 
intersect. Let p, q be shortest alternating paths witnessing x,y G T n U S. Let 
us choose the shortest among all alternating paths from x to F D G that runs 
within F. Let this path be p' and its endpoint x' G F n G. Do the same thing 
with y to get a path q' from y to y' G F n G lying within G. By symmetry we 
may assume that \p\ + \p'\ < \q\ + \q'\. 
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By the choice of p' we see that the only point on p' that is in G is its cndpoint 
x'. From x' there are two paths, s and t, leading to y within G by Definition ^. 101 
one of which, say s, can be appended to p' to get an alternating path from x to 
y. This path p'Us clearly starts and ends with a non-matching edge. 

Now we are in a situation to apply the previous lemma, p leads from po to 
x and ends with a matching edge, and p'Us leads from x to y and starts and 
ends with non-matching edges. Thus by the lemma, there is an alternating path 
from po to either x or y which ends with a non-matching edge. The length of 
this alternating path is at most |p| + \p'\ + \s\. But by the choice of p', the choice 
of q' , and by the age requirement in Definition 14.101 we have 

\p\ + \p'\ + \a\ < \q\ + \q'\ + \s\ < \q\ + \t\ + \s\ < \q\ + 2a(y) + 1 = 2n + 1. 

Thus the alternating path we have found from po to x or y has length at most 
2n + 1 so it witnesses x £ H n+ \ or y £ H n+ \. But neither is possible since both 
x and y are tough, which is a contradiction. □ 

Corollary 4.16. There is exactly one tough vertex adjacent to any family. 

Proof. Let x,y £ TT n and z £ F n {x). Suppose there is an edge between y and 
z.Then z is in B n , hence yz is a tough edge, hence z is in the family of y, but 
then the two families would not be disjoint, which is a contradiction. □ 

Let c\ = Ci(cq,oI) be a constant to be determined later. 

Claim 4.17. Suppose \F n (x)\ < c±, v £ F n (x), and there is an edge vw such 
thatw £ B n \F n (x) . Then either w £ F n+Cl (x) or x £ H n+Cl . In other words, if 
a vertex remains tough for an extended period of time, then its family consumes 
its neighbors. 

Proof. We can assume that x ^ H n+Cl since otherwise we are done. Thus x is 
still tough at the moment n + c\ . 

First suppose there is a path p £ 2l e , \p\ < 2n that ends in w and does not 
pass through x. Let w' £ p be the first even vertex on the path that is adjacent 
to some vertex v' £ F n (x). Then the initial segment of p up until w' has to be 
disjoint from F n (x). By definition, in F n (x) there has to be an alternating path 
from x to v' that ends in a matched edge. Extending this path through w' and 
then the initial segment of p, we get an alternating path from S to x. Its length 
is obviously at most \p\ + c\, hence x £ H n+Cl / 2 and consequently in H n+Cl , 
and this is a contradiction. 

That means that any even path from S to w of length at most 2n has to pass 
through x. Let p be the shortest such path. Let v' be the last vertex of p that is 
in F n (x) U {x}. The vertex v' divides p into two segments, p\ going from 5* to v 1 
andp2 from v 1 tow. Then \p 2 \ = \p\ — \pi\ < 2n — 2min{fc : x £ T^US} = 2a(x), 
and equality can only happen if x — v' . We claim that pi becomes part of the 
family at time n + c%. For any vertex y £ p 2 we can either go from x to v' 
in even steps and then continue along p 2 , or go from x to v in even steps and 
continue backwards on p 2 to y. The total length of these two paths is at most 
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c i + \p%\ + 1 + c i < 2(a(x) + ci) + 1. Since at moment n + c\ the age of x is 
exactly a(x) + ci, the set F„(a;) Up2 will satisfy the descendent property so this 
whole set, including w, will be part of F n+Cl (x). □ 

Definition 4.18. We will say that at moment n the family of the vertex x £ 
TT n is expanding if there is an edge vw such that v S F n (x) and w S B n \F n (x). 
For any a; G X, let e„(a;) be the number of moments m < n such that < 
|-Fm(a;)| < ci and at moment m the family was expanding. 

Corollary 4.19. For any x € X we have e n {x) < c\ independently of n. 

Proof. By Claim 14.171 we know that the number of moments in which an ex- 
panding family has a fixed size k < c\ is at most C\. This is because after the 
first such moment, in c\ time the family either ceases to exist or strictly grows. 
Thus for each possible size k there are at most c\ moments of expansion, and 
thus there are at most c\ such moments in all. □ 

4.4 Invariants of growth 

Now we are ready to start the proof of Theorem 14.51 Let 
J(n) = \X n \ + \B n \ + - I e n (x)dx. 
or in the infinite connected case 

I(n) = \X n \ + \B n \ + ^Y1 e "^)- 

Proposition 4.20. Suppose that 

1. \X n \ < \X\X n \, 

2. there are no augmenting paths of length at most In — 1 in X n , and 

3. the number of forbidden edges is \Ek\ < d\S\ for all < k < n, 

4- the number of non-forbidden edges leaving X^ is at least l/(d+ 1) portion 
of all edges leaving Xk for all k < n. 

then 

Proof. In the following we shall omit the index n from all our notation, except 
where it would lead to confusion. Let TT denote the set of tough and TM the 
set of not-tough vertices within TU S. The tough vertices are further classified 
according to their families. TB denotes the tough vertices whose families have 
size at least c\. For tough vertices with smaller families, TE shall denote the 
ones that have expanding families and TG denote the rest. So 

S U T = TM U TT = TM U (TB UTEU TG) . 
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First let's take a tough vertex x € TG whose family is small and not ex- 
panding. Let \E(x, F(x))\ — k. By the assumption on the size of the smallest 
real odd cut we know that the number of edges leaving x U F(x) (which is a 
set of odd size!) is at least d + 1. But only d — k of these are adjacent to x, so 
at least k + 1 have to be adjacent to F(x). None of these edges can lead to B 
because this is a non-expanding family. Also non of these edges can lead to TT 
by Corollary 14.161 Hence all these edges have to go to H, TM, or the outside 
world O = X c . This means that 

\E{F{x),TG)\ = \E(F(x),x)\ < \E(F(x), H U TM U 0)\. (2) 

By Claim 14.121 we see that any edge between TG and B has to run between a 
vertex in TG and a member of its family. Thus integrating @ over x G TG 
and using that families are pairwise disjoint subsets in B we get that 

\E{B,TG)\ < \E(B, H U TM U 0)\ 

For any other tough vertex we bound the number of edges between it and B 
by the trivial bound d. Adding this to the previous equation we get 

\E(B,TT)\ < d\TB\+d\TE\ + \E{B, H U TM U 0)\ (3) 

We know that \T\ — \H\ because of the matching, so the total degrees of 
S U T is d\S\ more than the total degree of H. The edges between T U S and H 
contribute equally to these total degrees. In the worst case there are no internal 
edges in H . This boils down to the following estimate. 

\E{H,0)\ + \E{H,B)\+d\S\ < 

< 2\E(T U S,T U S)\ + \E(TUS, 0)\ + \E(TM, B)\ + \E{TT,B)\. 

Combining it with ([3]), and subtracting \E(H, B)\ from both sides we get 

\E(H,0)\ +d\S\ < 2\E(TU S,TU S)\+2\E(B,TM)\ + 

+ \E{B U T U S, 0)| + d\TB\ + d\TE\. 

Any vertex in TB has a family of size at least c\ , and all these are disjoint by 
Claim I4TT51 and contained in B. Thus we get that \TB\ < \B\/ci. Using this 
and adding \E(B L)T L) S,0)\ to both sides implies 

\E(X n ,0)\+d\S\ < 2\E(TU S,TU S) +2\E(B,TM)\ + 

+ 2\E{BUTUS,0)\ + —\B\+d\TE\. (4) 

Cl 

Any vertex in O n that is adjacent to B n U T n U S along an edge not in the 
forbidden set E n is going to be in X n+ i, hence 

\E(B n UT n US, O n ) \ E n \ < d(\X n+1 \ - \X n \). 
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By definition, any vertex in TM n that is adjacent to an edge coming from B n 
will be part of B n+ i or yield an augmenting path. Also, by Lemma 14.61 any 
edge in E(Sl)T n , SuT n ) has to be adjacent to a point in |-B„+i \ \B n \ or yield 
an augmenting path. This implies that 

2\E(T,T)\ + 2\E(B,TM)\ < 2d{\B n+1 \ -\B n \). 

By the 3rd assumption of the proposition we have \E n \ < d\S\. Plugging all this 
into (01 we get 

\E(X n ,0 )\E n \ ^ + + + \B„[ 

a c\ 

By Definition 14.181 for any vertex x £ TE n we get e n+ i(x) — e n (x) + 1, and 
thus 



e n+ i(x)dx — / e n (x)dx + \TE\. 
x Jx 

Hence the right hand side of ([5]) is exactly 2(I(n + 1) — I(n)) + \B n \/ci. Fur- 
thermore by the 4th and 1st assumptions of the proposition we have 

\E(X n ,O n )\E n \ > > co\X n \(l-\X n \) 

"' ~ d+1 ~ d+l ~ 2d + 2 l n| 

in the measurable case and 

\E(X n ,O n )\E n \ > |£;( f"'°" )l > > -?-\X n \ 

v ' ,x d+l - d+l ~ 2d + 2 l 1 

in the connected infinite case. So in either case we get 

"' \X n \-^<I(n + l)-I(n) 



4d(d+l)' 1 2ci 

Now we can complete the proof of the proposition. First, choose c\ = Ad{d- 
l)/c . Then, since \B n \ < \X n we get 

|Xn| <7(n + l)-/(n). 
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On the other hand, we know from Corollary 14.191 that e n (x) < c\. Obviously 
e n (x) — if x £ O n - Thus J x e n (x)dx < cf\X n \. Hence 

I(n) < (2 + -j^j \X n \ < cj\X n \ < 2c?(/(n + 1) - J(n)). 

Substituting ci = 4d(d + l)/c finishes the proof. □ 

This proposition implies that I(n) grows exponentially fast. But as we have 
seen, \X n \ can be bounded from below in terms of I{n). This will imply fast 
growth of \X n \ too. 
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Proof of Theorem \4-5\ 

Since S C Xq, we have \S\ < 1(0). Then again by Corollary 14.191 we have 
I (n) < c\\X n \. So by Proposition 

Substituting c\ — Ad(d + l)/c we get the desired result. □ 

In the measurable case, when X n becomes large, the method apparently 
breaks down. The main problem is that expansion guarantees only co|A„|(l — 
\X n \) edges between X n and O n . When X n is large, the 1 — \X n \ term will be 
the dominant. It was crucial to choose c\ so that the \B n \/ci terms becomes 
comparable to the lower bound coming from expansion. But for large B n , hence 
small 1 — \X n \, this cannot be done with a constant c\. The smallest c\ that has 
a chance to work is roughly on the scale of 1/e. But then the upper bound for 
I(n) becomes (1/e) 3 and all of a sudden the time needed for X n to exceed 1 — e 
becomes super-linear in 1/e instead of the desired poly-logarithmic dependence. 

This loss of time comes from the part where we argued that any family grows 
bigger than c\ in c\ time. This observation was sufficient for a constant c\, but 
is clearly insufficient when c\ ~ 1/e. In this part we will show that, in fact, 
families grow much faster than what Claim 14.171 asserts. It turns out that in a 
sense families grow exponentially, hence it takes much less time than (1/e) 2 to 
reach a size of 1/e. This will allow us to "fix" the argument in Section l4~4l 

4.5 Family business 

In this section we shall examine in detail the lifecycle of a family. Let us fix a 
vertex x G X. At some no, this x may become an element of T„ . Then later it 
may start to have neighbors in B ni (for a larger value n\ > uq). At this point it 
can become tough and start to have a family. This family grows in time, until 
at some even larger value of n the vertex finally becomes part of B n . We want 
to understand the part when x becomes tough and its family starts growing. 
To this end we shall recursively define a sequence of " special moments" 

"-o < « i < n 2 < n 3 . . . 

and an increasing sequence of sets 

= FX C FXi C FX 2 C FX 3 . . . 

that control how fast the family grows. The definition is rather complicated, 
so we present it step-by-step, along with the notation. For any n.; < n < n^+i 
we write c(n) = i, and think of it as a counter. The sets FXk are going to be 
defined such that the following hold: 

1. FXk is the union of some matched pairs of vertices. 
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2. For any matched edge vw C FXk there is an alternating path p that starts 
in x, lies entirely in FXk, ends with the matched edge (in either direction) 
and has length at most 2k. 

3. fl c („) C F n (x) holds for all n when x is tough, as shown on this scheme 
of evolution: 

FX a =$ FXiCF ni (a) FX 2 CF n2 (x) FX 3 CF n3 (x) 

no > n\ > ri2 > n$ > 114 .. . 

Suppose we have already fixed rik and FXk- 

Definition 4.21. Let rrik denote the smallest moment rrik > nu in which there 
are at most d edges leaving FXk U {x} that do not end in B„ lk \ FXk- Let this 
set of edges be denoted by Ek- Now define FXk+i to be the extension of FXk 
by those matched edges in B mis that can be the last edge of an alternating path 
of length at most 2k starting from x, and lying entirely in FXk except for its 
last two vertices. 

FXk+i = FXk U{d£ B mk : Bp e 2t e (|p| < 2k + 2; Po = x; 

Pl,...,P2k G FX k 5 P2fe+1 = V or p 2 k+2 = v} (6) 

It is clear that this construction satisfies the first two conditions stated just 
above Definition ^. 21[ but there is no reason for FXk+i to be a subset of F mk (x). 
However, if we choose rik+i — m/. + 2k then the following claim implies that the 
third condition will be also satisfied. 

Claim 4.22. While x is tough, FXk+i C F mk+ 2k{x) for all k, hence FX c r n -\ C 
F n (x) for all n. 

Proof. This is very similar to Claim 14.171 We argue by induction on k. Then 
we can assume that FXk C F„ lk . We need to show that FX^+i C F mk+2 k+i- 
Take a matched edge vw C FXk+i \ FXk- By definition there is an alternating 
path p of length at most 2k + 2 starting in x, ending in the vw edge, and lying 
in FXk- Suppose its last vertex is w. Since vw C B mil there has to be a path 
q proving this, ending in the same edge, but in the opposite order: wv. Let's 
take the shortest such path. It has to pass through x, otherwise x would not be 
tough at n = rrik + k. Denote the part of this path between x and v by q. Now 
we have two paths from x. The path p ends with vw while the path q ends with 
wv. The length of p is at most 2k + 2, the length of q is at most 2a(x). We will 
show that some subset of q together with F mk satisfies the descendent property 
at n = rrik + 2fc + 1. 

Lemma 4.23. Suppose p and q are alternating paths, both starting with a non- 
matched edge from the same vertex x and ending in a matched edge vw but from 
different directions. Then there is a subset U C q containing v and w, such 
that for each vertex z £ U there are two alternating paths between x and z of 
different length-parities, lying entirely in U U p, whose total length is at most 
\q\+2\p\-3. 
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Before giving the proof of the lemma, let us show how this completes the 
proof of the claim. It is easy to see that U UF mk satisfies the descendent property 
at time mt + 2k. First of all, by definition, the set F mk itself satisfies it. On the 
other hand for any vertex in U the lemma guarantees the existence of the two 
alternating paths lying entirely in U Up C U UF mk , since p C F mk by induction. 
The sum of the length of these two paths is at most 2a(x) + 2{2k + 2) — 3 = 
2a(x) + 4fc + 1. The age of x at n = nik + 2k is a(x) + 2k and so we are done. 
This completes the proof of the induction step, hence the claim is true. □ 

Proof of Lemma \4-23\ If p and q are disjoint apart from their endpoints, then 
the statement is obvious with U — q, and we even get the stronger upper bound 
|g| + \p\ on the total length of the two paths for any vertex in U. If p and q are 
badly intertwined, we need to be cautious. Let x — ?o> <7i> • - • > <Z2Z = v denote 
the vertices of q. Since both p and q are alternating paths, their intersection is 
necessarily a union of matched edges. For each matched edge <72j-i?2i the path 
p may contain this edge, or not. The ones that are contained in p will be called 
double edges. For each double edge, p may contain it in the same orientation 
as q - these will be called good double edges, or the opposite orientation as q - 
these will be called bad double edges. 

There are two natural partial orders on the set of matched edges of p and q. 
For two such edges e and / will write e < q f if e comes before / on the path 
q. We will write e < p f if e comes before / on p. (If one or both of the edges 
aren't on a given path, they are incomparable in the given order.) Now for any 
matched edge e on g, we define 

Z(e) = min{/ : / > q e}. 

Note that, since the q-maximal edge vw is a double edge, Z(e) is always well- 
defined. Also note that Z(Z(e)) = Z(e). Next, let 

/ = max{e G q : Z(e) = e is a good double edge}, 

and let x' be the vertex of / further away from x. If there is no such double 
edge, then / is not defined, and we just choose x' = x. Let q' be the part of q 
from x' to v, let p' be the part of p between x' and w, and let p" be the part of 
p between x and x' . We claim that U = q' \ p is a good candidate. 

First of all, observe that p"P\q' — x' . When x' — x this is obvious. Otherwise 
it is still true because Z(f ) = /, which means that any edge in q' is visited by 
p later than / is visited by p. Second, take any matched edge e G q'. By 
definition, / < q e. Hence 

f< q e> q Z(e) = Z{Z(e)), 

so by construction Z(e) has to be a bad double edge. Now we can exhibit the 
two alternating paths between x and the edge e. 

From one direction we can simply reach it by going on p" until x' and then 
continuing on q' until we reach e. This is a path, since p" n q' = x' . From the 
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other direction, start at x and go on p" to x' and then further on p' until hitting 
Z(e). Since Z(e) is a bad double edge, we have just visited it in the 'wrong' 
direction on q. So we can now continue on q backwards from Z{e) until we come 
to e. The concatenation of these two segments is still a path, since by definition 
of Z(e), the part of p between x and Z(e) is disjoint from the part of q between 
e and Z(e). The total length of the two paths we have just exhibited is at most 
2\p"\ + \q'\ + \p'\ — 1. The —1 comes from the fact that the vw edge is contained 
in both p and q, but has to be used at most once. Finally \p'\ > 2 thus the total 
length is at most \q\ + 2\p\ — 3. □ 

Now let's look at the connected component of x denoted by X' . It's a (finite 
or countable) connected d-regular c -expander graph with a partial matching. 
Let's remove the edge containing x from the matching. Let S' = {x} and let 
X' k — {x}UFXk- We have already defined the sets Ek that contain all the edges 
leaving XL not ending in B mk , hence in particular containing the once matched 
edge coming out of x. The sets X' k were constructed exactly according to the 
rules of Definition 14.31 Clearly £^1 < d\S'\. But since any odd set, in particular 
X' k , has at least d + 1 edges leaving it, of which at most d is forbidden, the 4th 
assumption of Theorem 14.51 is also satisfied. Thus it applies in this situation 
and implies that as long as x remains tough and |.F n (x)| < |-X"'\.F n (x)|, we have 

iF„( I )i>i™ oW i>i^(i + ^)* > . 

In the countable case the |.F n (x)| < \X' \ F n (x)\ condition is always satisfied 
and \S'\ — 1, while in the finite case it is satisfied as long as the family doesn't 
occupy at least half of the graph, and \S'\ — 1/\X\. Thus in both cases we get 

Corollary 4.24. As long as x remains tough and \\F n (x)\\ < \X\/2, 

2 / 3 \ c (") 

II'.MH* H'^ll *3&(i + 3&) ■ 

where \ \ ■ \ \ denotes the actual size of the set in both the finite and the countable 
cases. 

Definition 4.25. Let us say, that for such moments when n,- < n < rrii for 
some i, the family of x € X is dormant, whereas for moments that satisfy 
rrii < n < m+i the family is active. Let f n (x) denote the number of moments 
m < n such that ||.F m (a;)|| < c^/e and at moment m the family was active, 
where e denotes the ratio of unmatched vertices in X. We will choose C3 = 2 
except in the case when X is finite and e — 2/||X||. In this case we will choose 
c 3 = 1. 

It is clear that f n (x) < c(n) 2 . Note that in the finite case either e > 4/||X|| 
and thus c 3 = 2 and c 3 /e < \\X\\/2, or e = 2/\\X\\ and c 3 /e = \\X\\/2. Hence 
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families that haven't reached the size c^/e are not bigger than half of the graph. 
Hence by 14.241 we have in both the measurable and the finite case that 




2 



fn(x) < 



< c 4 (1 + log 2 1/e) 



(7) 



for a suitably large C4 depending only on the previous constants and d. 



The proof will work similarly to that of Theorem 14.51 but one has to be more 
careful. This time we are interested only in the measurable case, and assume 
that all the sets E% of forbidden edges are empty. Thus Hk is simply the set of 
vertices that can be the end-point of an odd alternating path of length at most 
2k — 1 starting in S. We choose S to be half of set of unmatched vertices. Then 
as soon as we have S n H k ^ or F n X n ^ 0, we have found an augmenting 
path. 



We further reintroduce the notation from the proof of Theorem 14.51 As before, 
we will often drop the index n, when it does not cause confusion. Let TT denote 
the set of tough and TM the set of not-tough vertices within T U S. The tough 
vertices are further classified according to their families. TB denotes the tough 
vertices whose families have size at least c^/e. For tough vertices with smaller 
families, TE shall denote the ones that have active families at the moment, and 
TG denote the ones that have dormant families at the moment. So 



First let's take a tough vertex x S TG whose family is small and dormant. By 
Definition 14 . 2 1 1 this means, that there are at least d+1 edges leaving xUFX c ^ 
that do not end in B n . Let \E(x, FX C ^)\ = k < d. Then there are d — k edges 
leaving x\JFX c r n \ from x. The rest, at least k + 1 must leave from FX c / n y And 
since these edges do not end in B n , they actually have to leave the whole family 
F n {x). The only tough vertex adjacent to the family is x by Corollary 14. 161 so 
the k + 1 edges we have just exhibited must end in H U TM U O. When k < d, 
then (k + l)d/(d + 1) > k. So we have 



4.6 Proof of Theorem 14.21 



Let 




S U T = TM U TT — TM U (TB UTEU TG) . 



E(F(x),TG) 



E(F(x),x)\ < 



d 



\E(F{x),HUTMuO)\. 



d+1 



Integrating over TG we get that 



\E(B,TG)\ < 



d 



E(B,HUTMUO)\ 



d+1 
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For any other tough vertex we bound the number of edges between it and B by 
the trivial bound d. Adding this to the previous equation we get 

\E(B,TT)\ < d\TB\ + d\TE\ + -JL-\E(B,H \JTM \J 0)\ (8) 

Now let us examine the edges running between B n and its complement. By ([5J 
we have 

\E(B,X\B) = \E(B,HUOUTM)\ + \E(B,TT)\ < 

< 2\E(B,HuOUTM)\ + d\TB\ + d\TE\ 



and hence 



E(B,X\B)\ \E(B,HUTMUO)\ 1 „ 

< . , + o(\ TB \ + \ TE \)- 



2(d+l) ~ d + 1 2 

Adding this to ([8j then yields 
\E(B,X\B) 



2(d + l) 



■\E{B,TT)\ < \E(B,HUTMUO)\ + (d+l)(\TB\ + \TE\). (9) 



We know that \T\ — \H\ because of the matching, so the total degrees of 
S U T is d\S\ more than the total degree of H. The edges between TUS and H 
contribute equally to these total degrees. In the worst case there are no internal 
edges in H . This boils down to the following estimate. 

\E(H,0)\ + \E(H,B)\+d\S\ < 

< 2\E(TU S,TU 5)| + \E(TUS,0)\ + \E(TM,B)\ + \E(TT,B)\. 

\E(B X\B)\ 

Adding 2 (d+i) *° both sides, then using ([9]), and subtracting \E(H, B)\ from 
both sides we get 

\E(H, 0)| + ^2(d + i)*^ + d ^ ^ 2 \ E ( T[JS > T[JS )\ + 2\E(B,TM)\+ 

+ \E(BUTUS,0)\ + (d+l){\TB\ + \TE\). 
Adding \E(B UTU5,0)| to both sides implies 

\E(X n , 0)| + + d \S\ < 2\E(T U S, T U S) + 2\E(B, TM)\ + 

+ 2\E{B UTUS,0)\ + {d+l) (\TB\ + \TE\) . (10) 

Any vertex in O n that is adjacent to B n U T n U S is going to be in X n+ \, 
hence 

\E(B n ur„U5, O n )\ < d(\X n+1 \ - \X n \). 
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By definition, any vertex in TM n that is adjacent to an edge coming from B n 
will be part of B n+ i or yield an augmenting path. Also, by Lemma 14.61 any 
edge in E(Sl)T n , SUT n ) has to be adjacent to a point in |-B„+i \ \B n \ or yield 
an augmenting path. This implies that 

2\E(T,T)\ + 2\E(B,TM)\ < 2d(\B n+1 \ - \B n \). 

Plugging all this into (fTUl) we get 

\E(X n ,O n )\ , \E(B,X\B) 



2(d + l) 



\S\< 



^ T^7 - \ X "\ + - l B «D + \ TE \ + \ TB \ ( u ) 

a + 1 

By Definition 14.181 for any vertex x G TE n we get f n+ i(x) = f n (x) + 1, and 
thus 



f n+1 (x)dx = / f n {x)dx + \TE\. 
x J x 

Hence the right hand side of ((TTJ) is at most 2(J(n + 1) - J(n)) + |TB|. Fur- 
thermore by the expander assumption we have 



and 



\E(X n ,O n )\ >co|X„|(l-|X„|) 
\E(B n ,X\B n )\>co\B n \(l-\B n \) 



so from (fTTj) we get 
co|X„|(l- |X„|) co|B n |(l-|B„ 



d+1 2(d+l) 2 



+ |S1 - |TB| < 2(J(n + 1) - J(n)) (12) 



Any vertex in TB has a family of size at least c^/e, and all these are disjoint 
by Claim B~T5l and contained in B. Thus we get that \TB\ < e\B\/c 3 < e/2 = |5| 
in the measurable case and in the finite case when e > 4/||A||. In the finite 
case when e = 2/||A||, then any tough vertex in TB has a family of size at least 
||X||/2, and thus there can be at most one tough vertex. We get \TB\ < \S\ in 
all cases, and thus 

c |A»|(l-|A„|) cqI^KI-I^I) 

2(d+l) + 4(d+l)=» " J( " + 1} " J(n) - (13) 

If we could prove a similar growth estimate on the size of X n (or B n ), 
then the next lemma would imply that X n (or B n ) would grow too large in a 
sufficiently small number of steps, proving the existence of a short augmenting 
path. 
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Lemma 4.26. Let < ao < di < a.2, ■ ■ ■ be an increasing sequence of numbers. 
Let us fix a constant c and say that an index k is good ifak+i — > 2cafc(l — afe) 
holds. Then if the number of good indices up to N is at least 

log!? 2 -) 



then ajv > 1 — ao. 



Proof. Let us split the sequence into two parts. The first part will be where 
afc < 1/2 and the second part where ak > 1/2. 

In the first part if k is a good index then au+i > afe(l + c). Hence au > 
ao(l + c) 9 ^ where g{k) denotes the number of good indices up to k. So if 



log(l + c) 



we must have > 1/2, or in other words h\ already has to be in the second 
part. 

In the second part a good index k implies 1 — a^+i < (1 — <ifc)(l — c), hence 
if is such that 



g(N) = g(ki) + 





< 2 


r io g(^)i 


log(i^) 







then we must have 1 — a at < ao. 



□ 



The problem is that ([13| doesn't directly imply such a growth estimate on 
either X n or B n because a priori the integral term in J„ could absorb any growth 
implied by the inequality. We need one final trick to overcome this difficulty. 
The idea is that we don't need X n or B n to grow the desired amount in one single 
step. If we can find a not so large K such that \X n+ K~ X n \ > 2c(|X„|)(l — |X„|), 
or \B n+ x — B n \ > 2c(|-B„|)(l — \B n \), we are still good. So let us fix some K, 
whose precise value is to be determined later, and assume that 

\X n+K \ - \X n \ < y(|X„|)(l - |X„|) and \B n+K \ - \B n \ < j{\B n \){l - \B n \). 

This means that the growth of J(n) implied by (|13[) has to largely come from 
the J f n term. But note that once a vertex x has a positive /-value, then it has 
to be tough for the rest of its life, until it becomes part of B m for some later 
m, and from that point on its /-value remains constant. Hence if for some x we 
find that f n +i<(x) > fn(x), then either x € B n+ x \ B n , or a; £ TT n+ K- Also 



by (O we know that f n +K{ x ) 

\ fn+K(x)dx 



fn(x) < 04(1 + log 1/e). Hence we get 



I f n {x)dx< c 4 (l + log 2 1/e) [\B V 

X 



VK \B n \ + \TT n+K \). (14) 
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Further it is obvious that \TT n+ i<\ < 1 — \B, 1+ k\ < 1 — \B n \ and since each 
vertex in TT n+ x has a unique, non-empty family inside B n+ K, we also get that 
\TT n+K \ < \B n+K \ < \B n \+c /2\B n \{l - \B n \) < 2\B n \ . Hence we can simply 
write 

\TT n+K \ < 4\B n \(l - \B n \) 

because either \B n \ or 1 — \ B n \ is at least 1/2. We also have by assumption that 
\B n+K \B n \< co/2|fl„|(l - \B n \) < \B n \(l - \B n \). Plugging all this into 
we get 

/ f n+K {x)dx- f n {x)dx <c 4 (l + log 2 l/£)5|B„|(l- \B n \), (15) 
Jx Jx 

and by the assumptions on the small growth of X n and B n we can further 
deduce (assuming C4 is not really small) 

J(n+K)-J(n) < (6c 4 (l+\og 2 l/e))\B n \(l-\B n \) + co/2\X n \(l-\X n \). (16) 

On the other hand we can apply (fT5|) to n, n + 1, . . . ,n + K — 1. By the 
assumption on the small growth of X n and B n during this time, J^ Tl | ( 1 — \X n \) 
and |B n |(l — \B n ) do not change too much either. More precisely we can write 
for any n < m < n + K that \X n \ < \X rn \ and that 1 — |X„ + x| < 1 — \X m \. 
Also 

(l-\X n \)-(l-\X n+K \)<^\X n \(l-\X n \) 

and thus 

1 - \x n+K \ > (1 - ||x„|)(i - \x n ) > IzS. 

Putting all this together we get that 

\X m \(l - \X m \) > \X n \(l - \X n+K \) > ^|X„|(1 - \x n \), 

and the exact same equation holds for B m . Now summing (|13l) for n, n + 
1, . . . , n + K — 1 and using the last inequality, we find that 



K (c Q \X n \{l-\X n \) co|B n |(l-|B n | 



2 V 2(d+l) 4(d+l) 2 



<J(n + K)-J(n) (17) 



Now choose K so large that K > 2(d+l) and c K > 24c 4 (d+l) 2 (l + log 2 1/e)), 
and we clearly have a contradiction between (IT6"|) and (fTTj) . 

Corollary 4.27. This implies that for any n either \X n+ K\ — \X n \ > ^-|X n |(l — 
\X n \) or \B n+K \ - \B n \ > f\B n \(l ~ \B n \). 

Let us consider the sequences a n = \X n K+ no \ , b n — \B n K+n a \- Then Corol- 
lary 14.271 implies, using the language of Lemma 14.261 that every n is a good 
moment for either a n or b n . We know that ao|X„ | = e/2 > e/6. If we also 
knew that 60 = |-B no | > e/8, then by Lemma T4. 261 we could deduce that for 



k = n + 4K 



log(4/e) 



log(4/(4 - co)) 
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we have \X k \ > 1 - e/8 > 1 - e/2 or |B fc | > 1 - e/8 > 1 - e/2, cither of which 
implies the existence of an augmenting path. All we need to do to finish the 
proof of Theorem 14.21 is to exhibit a not too large no for which \B no \ > e/8. 

To this end we prove that as long as \B n \ is very small, the size of X n has 
to increase rapidly. Obviously J f n+ i(x) — J f n {x) < \TE\ < \B n \ since every 
tough vertex has a nonemtpy family. Hence 

J(n + 1) - J(n) < |X n+1 | - \X n \ + ^\B n+1 \. 

If \S\ > i\B n \ then, since clearly \TB\ < \B n \, we also have \S\ - \TB\ > 3|S„| 
and thus by ([T2j) we get 

^_|X„|(1-|X„|)<|X„ +1 |-|X„|. 
Then Lemma 14.261 implies that this cannot hold for more than 

2 r io g (i/ £ ) 

10 g( l- Co /4( d +l) ) 

steps. So this is a good choice for no. The dependence of K on log(l/e) is 
quadratic, of no linear, hence k is of order 0(log 3 (l/e)), the implied constant 
only depending on cq and d. This completes the proof of Theorem 14.21 
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